Missing data analysis: making it work in the real world.

نویسنده

  • John W Graham
چکیده

This review presents a practical summary of the missing data literature, including a sketch of missing data theory and descriptions of normal-model multiple imputation (MI) and maximum likelihood methods. Practical missing data analysis issues are discussed, most notably the inclusion of auxiliary variables for improving power and reducing bias. Solutions are given for missing data challenges such as handling longitudinal, categorical, and clustered data with normal-model MI; including interactions in the missing data model; and handling large numbers of variables. The discussion of attrition and nonignorable missingness emphasizes the need for longitudinal diagnostics and for reducing the uncertainty about the missing data mechanism under attrition. Strategies suggested for reducing attrition bias include using auxiliary variables, collecting follow-up data on a sample of those initially missing, and collecting data on intent to drop out. Suggestions are given for moving forward with research on missing data and attrition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

کاربرد متدولوژی ترکیبی تحلیل پوششی داده‌ها و ماتریس درجه ترجیح در ارزیابی واحدهای تصمیم گیری با رویکرد فازی

Data envelopment analysis (DEA) has been a very popular method for measuring and benchmarking relative efficiency of peer decision making units (DMUs) with multiple input and outputs. Traditional data envelopment analysis (DEA) models require crisp input and output data. In real world situations, however, crisp input and output data may not always be available, especially when a set of decision...

متن کامل

A method to solve the problem of missing data, outlier data and noisy data in order to improve the performance of human and information interaction

Abstract Purpose: Errors in data collection and failure to pay attention to data that are noisy in the collection process for any reason cause problems in data-based analysis and, as a result, wrong decision-making. Therefore, solving the problem of missing or noisy data before processing and analysis is of vital importance in analytical systems. The purpose of this paper is to provide a metho...

متن کامل

Classifying inputs and outputs in interval data envelopment analysis

Data envelopment analysis (DEA) is an approach to measure the relative efficiency of decision-making units with multiple inputs and multiple outputs using mathematical programming. In the traditional DEA, it is assumed that we know the input or output role of each performance measure. But in some situations, the type of performance measure is unknown. These performance measures are called flexi...

متن کامل

Probabilistic Linkage of Persian Record with Missing Data

Extended Abstract. When the comprehensive information about a topic is scattered among two or more data sets, using only one of those data sets would lead to information loss available in other data sets. Hence, it is necessary to integrate scattered information to a comprehensive unique data set. On the other hand, sometimes we are interested in recognition of duplications in a data set. The i...

متن کامل

 بررسی افزایش کارایی بیمارستان با استفاده از شبکه های هوشمند

  Background: Achieved results from this research shows that despite in health system in country efficiencies such as work force efficiency(doctors and nurses) is not incomputable  and percentage of bed occupation and regulating shift work program prepare manually and with paper, this issue results in time consuming of managers and wasting costs and errors in carried out computations. Therefor...

متن کامل

Design and Test of the Real-time Text mining dashboard for Twitter

One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Annual review of psychology

دوره 60  شماره 

صفحات  -

تاریخ انتشار 2009